Handling Imprecision & Incompleteness in Autonomous Databases
نویسندگان
چکیده
As more and more information from autonomous web databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness due to missing attribute values (aka “null values”) in the database. In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addition to presenting answers which satisfy the user’s query, the query processor is expected to provide highly relevant answers even though they do not exactly satisfy the query predicates. This broadened view of query processing poses several technical challenges. We propose a decision theoretic model for ranking answers in the presence of imprecision and incompleteness in the order of their expected relevance to the user. This model combines a relevance function that reflects the relevance a user would associate with answer tuples and a density function which reflects the each tuple’s distribution of missing data. Adoption of this model foregrounds three general challenges: (i) how to assess the relevance and density functions automatically (ii) how to support efficient query processing to retrieve relevant tuples and (iii) how to make users trust the recommended answers. We present a general framework for addressing these challenges, describe a preliminary implementation and discuss the results of a preliminary empirical evaluation.
منابع مشابه
Elicitation, Estimation & Explanation Challenges in Handling Imprecision & Incompleteness in Autonomous Databases (Position Paper for Penn II Workshop) Topic: Imprecision and uncertainty in data and inferences
We will motivate the problem of simultaneously handling incompleteness and imprecision in autonomous databases. We will argue that effectively tackling this problem requires solutions to density and relevance estimation, query rewriting and result explanation. We will show that solving these problems requires tools from decision theory, utility elicitation, statistical learning, as well as core...
متن کاملQUIC: Handling Query Imprecision & Data Incompleteness in Autonomous Databases
As more and more information from autonomous databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness in the data due to missing attribute values (aka “null values”). In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addition to ...
متن کاملQUIC: A System for Handling Imprecision & Incompleteness in Autonomous Databases (Demo)
As more and more information from autonomous databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness in the data due to missing attribute values (aka “null values”). In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addition to ...
متن کاملA Novel Type-2 Adaptive Neuro Fuzzy Inference System Classifier for Modelling Uncertainty in Prediction of Air Pollution Disaster (RESEARCH NOTE)
Type-2 fuzzy set theory is one of the most powerful tools for dealing with the uncertainty and imperfection in dynamic and complex environments. The applications of type-2 fuzzy sets and soft computing methods are rapidly emerging in the ecological fields such as air pollution and weather prediction. The air pollution problem is a major public health problem in many cities of the world. Predict...
متن کاملAn Investigation of the Cost and Accuracy Tradeoffs of Supplanting AFDs with Bayes Network in Query Processing in the Presence of Incompleteness in Autonomous Databases
As the information available to lay users through autonomous data sources continues to increase, mediators become important to ensure that the wealth of information available is tapped effectively. A key challenge that these information mediators need to handle is the varying levels of incompleteness in the underlying databases in terms of missing attribute values. Existing approaches such as Q...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006